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Abstract 

Background: Online psychiatric texts are natural language texts expressing depressive problems, published by 
Internet users via community-based web services such as web forums, message boards and blogs. Understanding 
the cause-effect relations embedded in these psychiatric texts can provide insight into the authors' problems, thus 
increasing the effectiveness of online psychiatric services. 

Methods: Previous studies have proposed the use of word pairs extracted from a set of sentence pairs to identify 
cause-effect relations between sentences. A word pair is made up of two words, with one coming from the cause 
text span and the other from the effect text span. Analysis of the relationship between these words can be used to 
capture individual word associations between cause and effect sentences. For instance, (broke up, life) and 
(boyfriend, meaningless) are two word pairs extracted from the sentence pair: "I broke up with my boyfriend. Life is 
now meaningless to me". The major limitation of word pairs is that individual words in sentences usually cannot 
reflect the exact meaning of the cause and effect events, and thus may produce semantically incomplete word 
pairs, as the previous examples show. Therefore, this study proposes the use of inter-sentential language patterns 
such as «rbroke up, boyfriend>, <life, meaningless» to detect causality between sentences. The inter-sentential 
language patterns can capture associations among multiple words within and between sentences, thus can provide 
more precise information than word pairs. To acquire inter-sentential language patterns, we develop a text 
mining framework by extending the classical association rule mining algorithm such that it can discover frequently 
co-occurring patterns across the sentence boundary. 

Results: Performance was evaluated on a corpus of texts collected from PsychPark (http://www.psychpark.org), a 
virtual psychiatric clinic maintained by a group of volunteer professionals from the Taiwan Association of Mental 
Health Informatics. Experimental results show that the use of inter-sentential language patterns outperformed the 
use of word pairs proposed in previous studies. 

Conclusions: This study demonstrates the acquisition of inter-sentential language patterns for causality detection 
from online psychiatric texts. Such semantically more complete and precise features can improve causality 
detection performance. 
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Background 

Online community-based services such as web forums, 
message boards, and blogs provide an efficient and ef- 
fective way for sharing information and gathering know- 
ledge [1-3]. In the field of mental health care, these 
services allow individuals to describe their life stresses 
and depressive problems to other Internet users or 
health professionals who can then make recommenda- 
tions to help the subject developing the knowledge 
needed to seek appropriate care. Examples of these web- 
sites include Depression Forums a , PsychPark b , SA-UK C , 
WebMD d , and Yahoo! Answers 6 . This paper refers to this 
type of online post as online psychiatric texts, and their 
major characteristic is that they are in the form of nat- 
ural language texts, featuring many cause-effect relations 
between sentences. Some examples of causality sen- 
tences are presented below: 

(El) / couldn't sleep for several days because my boss 
cut my salary. 

(E2) / failed again. I felt very upset. 

(E3) / broke up with my boyfriend. Life now is 

meaningless to me. 

These examples indicate three depressive problems 
caused by negative life events experienced by the 
speaker. Awareness of such cause-effect relations be- 
tween sentences can improve our understanding of 
users' problems and make online psychiatric services 
more effective. For instance, systems capable of identify- 
ing causality from online forum posts could assist health 
professionals in capturing users' background information 
more quickly, thus decreasing response time. Addition- 
ally, a dialog system could generate supportive responses 
if it could understand depressive problems and their 
associated reasons embedded in users' input. Recent 
studies also show that causality is an important concept 
in biomedical informatics [4], and identifying cause- 
effect relations as well as other semantic relations could 
improve the effectiveness of many applications such as 
question answering [5-7], biomedical text mining [8-10], 
future event prediction [11], information retrieval [12], 
and e-learning [13]. Therefore, this paper proposes a text 
mining framework to detect cause-effect relations be- 
tween sentences from online psychiatric texts. 

Causality (or a cause-effect relation) is a relation be- 
tween two events: cause and effect. In natural language 
texts, cause-effect relations can generally be categorized 
as explicit and implicit depending on whether or not a 
discourse connective (e.g., "because", "therefore") is 
found between the cause and effect text spans [14-16]. 
For instance, the example sentence El contains an expli- 
cit cause-effect relation due to the presence of the dis- 
course connective "because" which signals the relation. 



Conversely, both E2 and E3 lack a discourse connective 
and thus the cause-effect relation between the sentences 
is implicit. Traditional approaches to identifying explicit 
cause-effect relations have focused on mining useful dis- 
course connectives that can trigger the cause-effect rela- 
tion. Wu et al. [17] manually collected a set of discourse 
connectives to identify cause-effect relations from psy- 
chiatric consultation records. Ramesh and Yu [18] pro- 
posed the use of a supervised machine learning method 
called conditional random fields (CRFs) to automatically 
identify discourse connectives in biomedical texts. Inui 
et al. [19] used a discourse connective "tame" to acquire 
causal knowledge from Japanese newspaper articles. Al- 
though discourse connectives are useful features for 
identifying causality, the difficulty inherent in collecting 
a complete set of discourse connectives may result in 
this approach failing to identify the cause-effect relations 
triggered by unknown discourse connectives. In 
addition, it may also fail to identify implicit cause-effect 
relations that lack an explicit discourse connective be- 
tween the sentences. Accordingly, other useful features 
and algorithms have been investigated to identify impli- 
cit causality within [20,21] and between sentences 
[22,23]. Efforts to identify causality within sentences 
have investigated features that consider sentence struc- 
ture. Rink et al. [20] proposed the use of textual graph 
patterns obtained from parse trees to determine whether 
two events from the same sentence have a causal rela- 
tion. Mulkar-Mehta et al. [21] introduced a theory of 
granularity to identify sentences containing causal rela- 
tions. Features across the sentence boundary could be 
useful in identifying causality between sentences because 
such features can capture feature relationships between 
sentences. For instance, word pairs in which one word 
comes from the cause text span and the other comes 
from the effect text span have been demonstrated to be 
useful features for discovering implicit causality between 
sentences [22,23] because they can capture individual 
word associations between cause and effect sentences. In 
the E2 sample sentence pair, the word pair (fail, upset) 
helps identify the implicit cause-effect relation that holds 
between the two sentences. 

However, within the sentences, individual words usu- 
ally cannot reflect the exact meaning of the cause and 
effect events which, taking E3 as an example, may pro- 
duce semantically incomplete word pairs such as (broke 
up, life), (broke up, meaningless), (boyfriend, life), and 
(boyfriend, meaningless). In fact, many cause and effect 
events can be characterized by language patterns, i.e., 
meaningful combinations of words. For instance, in E3, 
the first sentence (cause) can be characterized by a lan- 
guage pattern < broke up, boyfriend >, and the second 
sentence (effect) can be characterized by < life, mean- 
ingless >. Combining these two intra-sentential language 
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patterns constitutes a more semantically complete inter- 
sentential language pattern <C broke up, boyfriend>, 
<life, meaningless ». Such inter-sentential language 
patterns can provide more precise information to im- 
prove the performance of causality detection because 
they can capture the associations of multiple words 
within and between sentences. Therefore, this study 
develops a text mining framework by extending the clas- 
sical association rule mining algorithm [24-28] such that 
it can mine inter-sentential language patterns by associ- 
ating frequently co-occurred patterns across the sen- 
tence boundary. The discovered patterns are then 
incorporated into a probabilistic model to detect causal- 
ity between sentences. 

The rest of this paper is organized as follows. We first 
describe the framework for inter-sentential language pat- 
tern mining and causality detection. We then summarize 
the experimental results of and present conclusions. 

Methods 

Figure 1(a) illustrates the framework of inter-sentential 
language pattern mining and causality detection. The 
online psychiatric texts are a collection of forum posts 
collected from PsychPark (http://www.psychpark.org), a 
virtual psychiatric clinic maintained by a group of volun- 
teer professionals belonging to the Taiwan Association 
of Mental Health Informatics [29,30]. A set of discourse 
connectives based on the results of previous studies 
[16,17] was created to select causality sentences from 
the online psychiatric texts. These causality sentences 



are then split into cause and effect text spans by remov- 
ing the discourse connectives between them. For in- 
stance, in Figure 1(b), the sample causality sentences can 
be split by removing the discourse connective "so". Next, 
the sets of cause and effect text spans are processed by 
the algorithm in two steps: intra- sentential and inter- 
sentential language pattern mining. Intra-sentential 
language pattern mining is used to discover language 
patterns of frequently co-occurring words within the 
cause and effect text spans. Once the intra-sentential 
language patterns are discovered, the frequently co- 
occurred patterns between the cause and effect text 
spans are then combined to form a set of inter- 
sentential language patterns. As indicated in Figure 1 
(b), two intra-sentential language patterns < broke up, 
boyfriend > and < life, meaningless > are discovered 
from their respective cause and effect text spans, and 
they constitute an inter-sentential language pattern 
« broke up, boyfriend>, <life, meaningless ». Finally, 
the acquired inter-sentential language patterns are 
used as features to detect causality between sentences. 

The following subsections describe how the proposed 
mining algorithm extends the classical association rule 
mining to acquire both intra- and inter-sentential lan- 
guage patterns. 

Intra-sentential language pattern mining 

This section describes two methods for generating intra- 
sentential language patterns: extended association rule 
mining and sentence parsing. 
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Figure 1 (a) Framework of inter-sentential language patterns mining and causality detection, (b) Example of inter-sentential language 
patterns mining. 
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Method 1: extended association rule mining 

For the mining of intra-sentential language patterns, ra- 
ther than mining frequent item sets in the classical asso- 
ciation rule mining problem, we attempt to mine 
frequent word sets (frequently co-occurred words) in the 
sets of cause and effect text spans. For this purpose, we 
adopted a modified version of the Apriori algorithm 
[24,31,32]. The basic concept behind the Apriori algo- 
rithm is the recursive identification of frequent word 
sets from which intra-sentential language patterns are 
then generated. For simplicity, only nouns and verbs are 
considered in language pattern generation. The detailed 
procedure is described as follows. 

Find frequent word sets within cause and effect 
text spans 

A word set is frequent if it possesses a minimum level of 
support. The support of a word set is defined as the number 
of times the word set occurs in the set of cause (or effect) 
text spans. For instance, the support of a two-word set \w b 
w ; ) denotes the number of times the word pair {w i} wj) occurs 



in the set of cause (or effect) text spans. The frequent k- 
word sets are discovered from (/c-l)-word sets. First, the 
support of each word (i.e., the word frequency) was counted 
from the set of cause (or effect) text spans. The set of fre- 
quent one-word sets, denoted as L 1} was then generated by 
choosing the words with a minimum support level. To cal- 
culate L k , the following two-step process is performed itera- 
tively until no more frequent /c-word sets are found. 

• Join step: A set of candidate /c-word sets, denoted as 
Ck, is first generated by merging frequent word sets 
of L k _ 1} in which only the word sets with identical 
first (k-2) words can be merged. 

• Prune step: The support of each candidate word set 
in C k is then counted to determine which candidate 
word sets are frequent. Finally, the candidate word 
sets with a support count greater than or equal to 
the minimum support form L k . The candidate word 
sets with infrequent subsets were eliminated. 
Figure 2 shows an example of generating L k . The 
maximum value of L k is determined when no more 
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Figure 2 Generating intra-sentential language patterns for both cause and effect events. 
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frequent /c-word sets are found in the generation 
process. 

Generate intra-sentential language patterns from 
frequent word sets 

Once the frequent word sets have been identified, the 
intra-sentential language patterns can be generated via a 
confidence measure. Let Ipi —< w±, . . . , w k > denotes an 
intra-sentential language pattern of k words. The confi- 
dence of Ipi is defined as the mutual information of the 
k words [33-35], as shown below: 



Conf(lpi) = MI(w u . . . w k ) 
= P(w u ...w k )log 



P(w u ...w k ) 

nti^o 



(1) 



where P(w±, . . . w k ) denotes the probability of the k 
words co-occurring in the set of cause (or effect) text 
spans, and P(wi) denotes the probability of a single word 
occurring in the set of cause (or effect) text spans. Ac- 
cordingly, for every frequent word set in Z, k , an intra- 
sentential language pattern is generated if the mutual 
information of the k words is greater than or equal to 
a minimum confidence. The resulting intra-sentential 
language patterns are those with a minimum confi- 
dence level. Figure 2 shows an example of generating 
intra-sentential language patterns from L k . 

Method 2: sentence parsing 

In addition to the extended association rule mining pre- 
sented above, sentence parsing that considers sentence 
structure can also be used to discover word dependen- 
cies in sentences. Therefore, this study uses a parser 
developed by Academia Sinica, Taiwan [36] to generate 
intra-sentential language patterns by deriving word pairs 
with proper dependencies from the parse trees of both 
cause and effect text spans. Figure 3 shows the parse 
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NP 



Head possessor Head 
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my boss cut 
Figure 3 Example of a parse tree. 



my 



salary 



tree output for the sample sentence: My boss cut my 
salary. 

The parser assigns a phrase label (e.g., NP, VP, PP, etc.) 
and a semantic label (e.g., Head, possessor, theme, etc.) 
to each constituent in the sentences. The dependencies 
of each word and its head are then considered as the 
intra-sentential language patterns. For example, in 
Figure 3, the intra-sentential language patterns for the 
sample sentences include (my, boss), (my, salary), (boss, 
cut), and (salary, cut). 

Inter-sentential language pattern mining 

An mter-sentential language pattern is composed of at 
least one intra- sentential language pattern for cause 
events and one for effect events. Therefore, once the 
intra-sentential language patterns for cause and effect 
events are generated using each of the abovementioned 
methods, the next step is to generate inter-sentential 
language patterns by finding frequently co-occurring 
patterns between the cause and effect text spans. This 
can be accomplished by repeating the same procedure 
presented above for extended association rule mining to 
find frequent pattern sets which are then used to gener- 
ate inter-sentential language patterns. 

Find frequent pattern sets between cause and effect 
text spans 

The procedure for finding frequent pattern sets only dif- 
fers from that of finding frequent word sets in terms of 
the definition of the support measure. In finding fre- 
quent word sets, the support of a word set is defined as 
the number of times the word set occurs in the set of 
cause (or effect) text spans. In this step, a pattern set is 
composed of at least one pattern from cause events and 
one from effect events. Therefore, the support of a pat- 
tern set is defined as the number of times the pattern set 
occurs between the sets of cause and effect text spans. 
For instance, suppose a two-pattern set {lpulpj\ where lp t 
and Ipj respectively denote an intra-sentential language 
pattern for the cause and effect events. The support of 
this two-pattern set is the number of times, lp t and Ipj 
co-occur between the sets of cause and effect text spans. 
Therefore, in searching for frequent pattern sets, all 
combinations of the intra-sentential language patterns 
for the cause and effect events are considered as candi- 
date pattern sets. The join and prune steps presented in 
the previous section can then be repeated to determine 
frequent pattern sets from all possible pattern combina- 
tions. Figure 4 shows an example. 

Generate inter-sentential language patterns from 
frequent pattern sets 

Similar to the procedure for generating intra-sentential 
language patterns, this step requires a confidence 
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Figure 4 Generating inter-sentential language patterns for cause-effect relations. 



measure to generate inter-sentential language patterns 
from frequent pattern sets. In generating intra-sentential 
language patterns, the confidence score is used to 
measure the mutual information of the words in a fre- 
quent word set. In this step, the confidence score is used 
to measure the mutual information of the patterns in a 
frequent pattern set. Let islpi —< Ipi, . . . ,lpk > denotes 
an inter-sentential language pattern of k patterns. The 
confidence of islpi is defined as the mutual information 
of the k words, as shown below: 



on the detection of implicit cause-effect relations. Other 
studies have also demonstrated the use of surface text 
patterns for relation extraction [37,38]. Given a sentence 
pair (s i} sj) without any discourse connective between s t 
and sp the goal is to classify the sentence pair into caus- 
ality or non-causality, as shown below: 



argmax P (c^ \ S( , sj ) 

Ck 

argmax P(si,Sj\ck)P(ck), 



(3) 



Conf(islpi) = MI(lpi, Ipk) 

= P(lp!,...,lp k )l0g 



P(lp U ...,lp k 



(2) 



where P(lpi, • • • , Ipk) denotes the probability of the k pat- 
terns co-occurring between the sets of cause and effect 
text spans, and P{lpj) denotes the probability of a pattern 
occurring in the set of cause (or effect) text spans. The 
resulting inter-sentential language patterns are those 
with a minimum confidence score. Figure 4 shows an 
example. 

Causality detection 

This section describes the use of inter-sentential language 
patterns to detect causality between sentences, focusing 



where c* is the prediction output, representing causality 
(c/<=i) or non-causality {c/ <=0 ). Before prediction, the input 
sentence pair (s if sj) is first transformed into feature rep- 
resentation. This study uses both inter-sentential lan- 
guage patterns and previously proposed word pairs as 
features. As each sentence pair is transformed into pat- 
tern representation, it is represented by a single or mul- 
tiple inter-sentential language patterns depending on 
the number of patterns the sentence pair matched 
in the set of discovered inter-sentential language 
patterns. Therefore, a sentence pair containing n 
inter-sentential language patterns can be formally 
represented as = ISLP SiSj = {islpi, • • • Jslp n }. In 

the word-pair representation, each sentence pair is 
represented by a set of word pairs, denoted as 
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these two features, Eq. (3) can be re-written as 



s hSj ^ { (wf, Wj) \wi e s h Wj e sj}. By using 



Table 1 Statistics of experimental data 



c* = argmaxP(lSLP s . Sp WP s . Sj \c k )P(c k ). 



(4) 



where ISLP SiSj and WP ShS . represent the feature sets of 
inter-sentential language patterns and word pairs of the in- 
put sentence pair ($/,$/), respectively. Assume that ISLP S . S . 
and WP ShS . are independent. Eq. (4) can be re-written as 



c* = ar^^P(/5IP s . s J^)P(W s . S; .|^)P(^). 



(5) 



Assuming again that the elements in both ISLP ShS . and 
WP ShS . are independent, then 



argmax^^. =i P{islpi\c k )Y\ 
xP((wi,wj)\c k )P(c k ). 



(6) 



where P(islpi\c k ) and P{{wi, Wj) \c k ) denote the respect- 
ive probabilities of an inter-sentential language pattern 
and a word pair occurred in the causality or non- 
causality class, and P(c k ) denotes the probability of the 
causality or non-causality class. These probabilities can 
be estimated from the training data: 



P{islpi\c k ) = 
P((w h wj)\c k 



N(islpj,c k ) 
N{c k ) ' 

N(wi,Wj,c k ) 



N(c k ) 



P(ck) 



N(c k ) 
N ' 



(7) 
(8) 
(9) 



where N(islpi, c k ) and N(wt, Wj, c k ) denote the respective 
frequency counts of an inter-sentential language pattern 
and a word pair occurring in the causality or non-causality 
class, N(c k ) denotes the number of causality or non- 
causality sentences in the training data, and N denotes the 
total number of sentences in the training data. 

Results and Discussion 

This section presents the experimental results for causal- 
ity detection. We first explain the experimental setup, 
including experiment data, features used for causality 
detection, and evaluation metrics. The selection of opti- 
mal parameter settings for inter-sentential language pat- 
tern mining is then described, followed by the evaluation 
results of causality detection with different features. 



Data sets 



Training set Validation set Test set 



Number of causality 2,835 236 
sentence pairs 

Number of non-causality 5,200 245 
sentence pairs 

Total 8,035 481 



472 
728 
1,200 



Experimental setup 

• Data: A total of 9716 sentence pairs were collected 
from PsychPark [29,30], from which 8035, 481, and 
1200 sentence pairs were randomly selected as the 
training set, development set, and test set, 
respectively. For each data set, a set of discourse 
connectives collected based on the results of 
previous studies [16,17], were used to select 
causality sentence pairs. The statistics of the data 
sets are presented in Table 1. The training set was 
used to generate the inter-sentential language 
patterns and word pairs. The validation set was used 
to select the optimal value of the parameters used in 
inter-sentential language pattern mining. The test 
set was used to evaluate the performance of 
causality detection. 

• Features used for causality detection: This 
experiment used word pairs (WP) and inter- 
sentential language patterns (ISLP) as features to 
detect causality between sentences. For ISLP, we 
used ISLP ARM and ISLP parsing to denote the sets of 
inter-sentential language patterns generated from 
the intra-sentential language patterns respectively 
discovered using the extended association rule 
mining and sentence parsing. Thus, the causality 
detection method was implemented using three 
feature sets: WP, WP + ISLP ARM and 



WP + ISLP n 



where WP was used to construct a 



baseline for causality detection, while WP + ISLP ARM 
and WP + ISLP parsing were used to determine 
whether or not the newly proposed inter-sentential 
language patterns could further improve detection 
performance, and determine which method (i.e., 
extended association rule mining or sentence 
parsing) could generate intra-sentential language 
patterns more useful for subsequent inter-sentential 
language pattern mining for causality detection. 
Evaluation metrics: The metrics used for 
performance evaluation included recall, precision, 
and F -measure, respectively, defined as follows: 



Recall 



number of causality sentence pairs correctly identified by the method 
number of causality sentence pairs in the test set 



(10) 
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2 x recall x precision 

F — measure = . (12) 

recall + precision 

Evaluation of inter-sentential language pattern mining 

In inter-sentential language pattern mining, two para- 
meters may affect the quantity and quality of the discov- 
ered patterns: the size of training data and threshold 
value of confidence (Eq. (2)). The size of the training 
data set was used to control the number of documents 
used for pattern generation. The threshold value of 
confidence was used to control the number of patterns 
generated from training data. The optimal values of 
both parameters were determined by maximizing the 
performance of causality detection on the development 
set. Figure 5 shows the F-measure of causality detection 
for different proportions of training data. The results 
show that increasing the size of the training data set 
increased the performance of WP, WP + ISLP ARM , and 
WP + ISLP parsing , mainly because more useful features 
can be discovered from a larger training set. 

For the confidence threshold, a higher value represents 
a more confident pattern. In the pattern generation 
process, all discovered patterns were sorted in descend- 
ing order of their confidence values. A threshold per- 
centage was then applied to select the top N percent 
of patterns for causality detection. Figure 6 shows the 
F-measure of causality detection for different percen- 
tages of selected patterns. The results show that for 
WP + ISLP ARM performance increased as the threshold 




WP 



10 20 30 40 50 60 70 80 90 100 
Training size (%) 

Figure 5 Performance against different proportions of 
training data. 

\ J 



Page 8 of 10 



(ii) 



value increased to 0.3, indicating that the top 30 % of 
patterns were useful for detecting causality due to their 
higher level of confidence. When the threshold value 
exceeded 0.3, the performance decreased because the 
lower ranks contained more noisy patterns that tended 
to increase ambiguity in causality detection. For WP + 
ISLP parsing , the optimal threshold value was 0.7. 

Results of causality detection 

This section presents the comparative results of using 
different feature sets for causality detection. The 
results presented in Table 2 were obtained from the 
test set with 10-fold cross validation, using the optimal 
parameter settings selected in the previous section. A 
paired, two-tailed £-test was used to determine 
whether the performance difference was statistically 
significant. 

The row labeled WP indicates that it used word pairs 
alone as features, providing a baseline result for causality 
detection. Once inter-sentential language patterns were 
used, both WP + ISLP Parsing and WP + ISLP ARM im- 
proved the recall, precision, and F-measure over WP, in- 
dicating that the proposed inter-sentential language 
patterns are significant features for causality detection. 
As listed in Table 3, the inter-sentential language pat- 
terns are more semantically complete and can provide 
more precise information because they can capture the 
associations of multiple words within and between sen- 
tences. Conversely, word pairs such as (friend, energy) 




10 20 30 40 50 60 70 80 90 100 
Confidence threshold (%) 

Figure 6 Performance against different threshold values 
of confidence. 

V J 



. . number of causality sentence pairs correctly identified by the method 

Precision — . 

number of causality sentence pairs identified by the method 
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Table 2 Comparative results of causality detection with 
different feature sets 





Recall 


Precision 


F-measure 


WP 


58.60 % 


63.38 % 


60.83 % 


WP + ISLP Parsing 


61.11 % + 


64.51 % 


62.71 % + 


WP + ISLP ARM 


60.15 %* 


66.54 %* 


63.14%* 



+ ISLP Parsing vs WP significantly different (p < 0.05). 
* ISLP ARM vs WP significantly different (p < 0.05). 



and (investment, life), which consider only individual 
word relationships, are usually semantically incomplete 
and ambiguous, thus yielding lower performance. Both 
WP + ISLP Parsing and WP + ISLP ARM achieved a similar 
F-measure, indicating that both extended association 
rule mining or sentence parsing can generate intra- 
sentential language patterns that are useful for subse- 
quent inter-sentential language pattern mining for 
causality detection. 

Conclusions 

This study proposes the use of inter-sentential language 
patterns to detect cause-effect relations in online psychi- 
atric texts. We also present a text mining framework to 
mine inter-sentential language patterns by associating 
frequently co-occurring language patterns across the 
sentence boundary. Experimental results show that using 
the proposed inter-sentential language patterns im- 
proved the performance above the use of word pairs 
alone, mainly because the inter-sentential language pat- 
terns are semantically more complete and can thus pro- 
vide more precise information for causality detection. 
Future work will be devoted to investigating more useful 
cross-sentence features and information fusion methods 
to further improve system performance. 

Endnotes 

a http://www.depressionforums.org/forums. 
b http://www.psychpark.org. 
c http://www.social-anxiety.org.uk. 
d http://www. webmd.com. 
e http://answers.yahoo.com. 

Table 3 Examples of inter-sentential language patterns 

No. Inter-sentential language patterns 



Cause Effect 



1 


<<friend, argue>, 


<lose, energy>> 


2 


< investment, setback>, 


< life, trouble>> 


3 


<<performance, decrease>, 


<change, job>> 


4 


<<parents, divorce>, 


<family, break>> 


5 


<child, sick>, 


<feeling, upset>> 
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